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Abstract 

An improved noise canceling microphone is provided including robust design features and advanced 
noise control and speech discrimination convergence characteristics. Two adaptive controllers are used to 
ensure robust performance in quickly changing acoustic environments ensuring an acceptable minimum 
performance characteristic. AdditionaEy, a new real-time spectral estimation procedure is applied to a noise 
canceling communication microphone platform that permits continued and optimal adaptation of non-voice 
bandwidth frequencies during speech transients. 

Brief Description of the Figures 

Figure 1 is a block diagram of a general implementation of a dual adaptive filter for a noise canceling 
microphone that ensures a minimal performance equal to that of a passive noise canceling microphone. 

Figure 2 is a block diagram of an instantaneous convergence control of an adaptive filter in response 
to controller output power. 

Figure 3 is a general depiction of a frequency domain adaptive controller and its associated 
convergence control 

Figure 4 is a specific implementation of the frequency domain adaptive controller and the ftre'quency- 
dependent convergence control. 

Figure 5 is a block diagram of the combination of dual adaptive filtering and frequency domain 
adaptive filtering. 

Figure 6 is a depiction of two omni-directional microphones situated as a active noise canceling 
microphone. 
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Detailed Description and Preferred Embodiments 

The following paragraphs provide some background and prior art information in order to illustrate 
the specific characteristics of noise canceling microphones that are improved by this invention. Prior 
inventions have failed to provide for design robustness and the wide noise suppression bandwidth required 
for clear communication in high ambient noise fields. This invention focuses on providing a new noise 
canceling microphone using controller and algorithmic features that drastically improve the performance over 
the prior art noise canceling microphones. This application relies on the provisional Patent Application 
Serial No. 60/242,952 filed October 24, 2000, with inventors Michael Vaudrey and WiUiam Saunders entitied 
"Improved Noise Canceling Microphone". 

To review the nominal field being considered, it is recalled that passive noise canceling microphones 
typically incorporate a single membrane to sense ambient sound, where the housing of that membrane is 
open to the environment on both sides. Far-field sounds impact the membrane (essentially) equaUy on both 
sides, generating no net movement, and thus a low sensitivity. Near field sounds (such as when the 
microphone is placed close to a speaker's mouth) cause the membrane to move more significantiy in one 
direction tiian anotiier, causing a higher sensitivity. This higher sensitivity to close-range voice versus lower 
sensitivity to far- field ambient noise, provides a low frequency improvement in the signal-to-noise ratio 
because of the associated far field noise rejection; thus improving low frequency speech intelligibility. There 
are a multitude of patents tiiat cover tht passive noise canceling microphone concept in various ways 
including: 4258235, 3995124, 5329593, and 5511130 among others. The microphone invention described 
here is an active microphone and is therefore different from this prior art regarding passive elements. 

A second category of noise canceling microphones will be referred to as active noise canceling 
microphones. The most rudimentary active noise canceling microphones perform identically to die passive 
noise canceHng microphones mentioned above. The structural difference is that an active element such as a 
subtraction circuit is employed in order to electronically difference two microphone signals, in order to 
generate the noise canceled output signal. The two microphones are positioned facing away from each other, 
where one is directed toward the desired signal source, or speaker's mouth. There are patents focusing on 
the use of active elements in creating a noise canceling microphone including 5303307 and 5511130. The 
algorithms and design features presented herein are not anticipated by any of this prior art 

More advanced implementations of noise canceling microphones have arisen as a result of increased 
DSP processing capabilities, the present invention included. These adaptive active noise cancellation 
microphones typically include tiie use of an adaptive filter as part of the active canceling element and provide 



improved performance over bodi die passive and active noise canceling microphones. The invention 
disclosed herein is significandy different from die prior art in diis area as evidenced below. 



Patent number 5917921 by Sasaki et. al. is a very general embodiment of an adaptive active noise 
canceling microphone. Sasaki uses an adaptive filter widi two microphone signals to reduce die noise in one 
of those signals, using die odier as a reference input to die adaptive filter. The inventive elements described 
in the present invention are not described or anticipated by die disclosure of Sasaki, which only focuses on 
the general idea of using an adaptive filter widi two microphones for die purpose of reducing wind noise. 
The specific embodiments described by diis invention are not anticipated by Sasaki. 

Patent number 5953380 by Ikeda focuses on a very specific mediod for controlling die convergence 
parameter of the adaptation process as a fiinction of the two input signals. A complex series of delays and 
power estimations creates a single convergence parameter for die time domain adaptive filter. This single 
convergence parameter is varied widi die detection of die speech, as determined by the "SN power ratio 
estimation". Ikeda does not anticipate die present inventions because die need for robust performance in a 
physical product is not discussed; nor does Ikeda anticipate die concept of multiple frequency-dependent 
convergence parameters or the use of frequency-domain adaptive control. 

Patent number 5978824, also by Ikeda, is an adaptive filtering mediod for creating a "clean" estimate 
of the noise as weU as a "clean" estimate of the desired signal. The two adaptive filters create estimates of die 
desired signal and die noise signal, which are independendy used to generate convergence parameters for die 
two adaptive filters. The two adaptive filters used in Ikeda's invention are used to a) generate a more 
accurate estimate of die signal to noise at any given time and b) create more accurate estimates of die speech, 
as weU as the noise. The two adaptive filters used in die present invention provide an entirely different 
effect focused on improving robustness during quickly changing ambient noise disturbances; in addition, the 
arrangement of die adaptive filters in the present invention is completely different from and is not anticipated 
by Ikeda. 

Patent number 5473684 by Bardett and Zuniga describes two first-order differential microphones 
that are used to create an adaptive second-order differential microphone. The present invention uses two 
omni-directional microphones to create a single, adaptive, first-order differential microphone. The use of 
omni-directional microphones simpHfies die physical construction of the microphone assembly, since bodi 
transducer backplanes can remain secured in die housing. (FOD microphones must be open on bodi sides 
in order to be effective). No mention by Bardett is made concerning die use of two adaptive filters for 
optimizing the robust control of ambient noise. In addition, no mention is made of using a frequency 
domain adaptive algoridim for controlling multiple convergence parameters of individual frequencies. 



Patent number 5473702 by Yoshida et al. controls the adaptation of the adaptive noise-canceling 
filter by adjusting the convergence parameter as a function of the error signal. There are several options that 
are discussed through a complex rule-based system that vJtimately decides when the algorithm should 
temporarily cease adaptation. Frequency domain control of adaptation is not anticipated by Yoshida^ nor is 
5 the use of two adaptive filters for robust performance of a two-element adaptive noise canceling microphone 
design. 

Finally, patent number 5319736 by Hunt describes a digital signal processing system that creates a 
frequency spectrum of speech from noisy speech to be used by a speech recognition system. This system 
does not anticipate using multiple adaptive filters as disclosed herein. In addition, Hunt's system does not 
10 anticipate performing real-time frequency domain adaptive filtering for communication microphone 
applications. Instead the output of his system is used as an input to a frequency domain vocoder. 

In summary, this review of the prior art in adaptive noise canceling microphones directiy points to 
the need for a more robust design of an adaptive noise-canceling microphone where the minimum 
\.Q performance is at least as good as passive noise canceling microphones at all times and the maximum 
. J5 performance can far exceed that of the existing noise canceling microphones. Tests have shown that in 
I y highly reverberant environments, the passive noise control microphone design can perform better than the 
^■y pnor art adaptive noise canceling microphones discussed above, if safeguards are not appKed. The dual-filter 
, embodiment of this invention disclosed herein is such a safeguard that ensures tiie adaptive noise canceUng 

i;f; microphone will always perform at least as well as the passive version, thereby improving the robustness of 
;'-^0 any noise canceling microphone previously described in the prior art. 

The second failing of the prior adaptive noise canceling microphone designs is that fast variations in 
the noise field cannot be tracked when die adaptive filter has a small convergence coefficient This problem 
leads to increased average noise levels for the adaptive filter arrangements discussed by others. The first- 
25 stage, single-weight adaptive filter of the present invention eliminates the degradation associated with fast 
tracking of noise field variations. 

Finally, the prior art does not anticipate the need for firequency domain adaptation. This is a 
problem for all of those previously discussed inventions because the adaptation of the entire filter is halted at 
30 every frequency every time tiiere is a component of speech detected. This leads to sub-optimal wideband 
noise suppression. The solution offered by the present invention is to only adapt individual frequency bins, 
allowing non-speech, noise frequencies to be adapted while simultaneously halting adaptation for those 
frequency bins dominated by speech content. Detailed descriptions of the invention are provided next. 
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The invention disclosed as embodiments herein improves the performance of existing adaptive noise 
canceling microphone designs. The first improvement (which can be used simultaneously with the second) 
uses dual adaptive filters. The first adaptive filter acts as a single-weight gain calibrator to equalize two omni- 
directional microphones so that their subtraction is optimized to minimize the error output. Because this is 
only a single element adaptive filter, the output is the same as a tuned active noise canceling microphone, but 
achieved with minimal algorithmic complexity. The second adaptive filter is then used to perform the 
broadband noise control, focused primarily on high frequency ambient attenuation. The second design 
improvement creates an automatically adjustable convergence parameter for each frequency bin in the 
spectrum. Since speech formants can be tonal in nature, it is advantageous to continue to adapt components 
of the spectrum that do not contain speech, even during speech segments. By performing the adaptive 
filtering in the frequency domain, each weight update can be independentiy controlled by adjusting its 
respective convergence parameter. 

The first critical component of this invention is the microphone architecture. It is more 
advantageous from a performance and implementation standpoint, to use two omni-directional microphones 
situated as shown in Figure 6. Bartiett et. al. in (5473684) discussed the use of two first-order differential 
microphones to form a second-order differential microphone. Structurally, this is a difficult assembly to 
construct since both microphones must have the back and front open to the acoustic environment. This 
increases the distance between the membranes thereby decreasing high frequency coherence between the two 
microphones. As coherence decreases, performance of the adaptive feedforward controller also decreases. 
Therefore, it is essential to this invention that the transducer unit consists of two omni-directional 
microphones. Referring again to Figure 6, the first omni-directional microphone (49) is situated close to the 
speaker's mouth or the desired source (52) while the second microphone (48) is facing 180 degrees away 
from the first. Assuming the microphones are identical and have equal sensitivities, the amplitude of the 
voice (52) will be greater as measured by the close microphone diaphragm (51) than the amplitude measured 
by the second microphone diaphragm (50), Alternatively, the amplitude of the ambient noise (53) will be 
measured nearly equally by both diaphragms. Using omni directional microphones as in (48 and 49), the 
backs of the elements remain closed and can therefore be placed direcdy adjacent to each other in the 
microphone housing. This closer proximity serves to increase the broadband coherence between the two 
microphones, thereby improving the noise attenuating performance as compared to previous inventions. 
Furthermore, omni-directional microphones have a nearly flat frequency response, ensuring accurate 
reproduction of both the noise and the speech for improved low frequency control performance. This 
configuration of two omtn-directional microphones is used throughout the remainder of this discussion 



where the reference signal (adaptive filter input) is the microphone facing away from the speaker and the 
communication microphone is facing toward the speaker's mouth. 

The first part of this invention can be understood clearly by examining Figure 1. There are two 
omni- directional microphones (1 and 2) daat detect two different signals (c and r respectively) in the physical 
arrangement specified above. When ambient noise in the environment is detected by the microphones, it is 
detected almost equally by both the 1 and 2 microphones (so that c==r). However, when the person speaks, 
since microphone 1 is closer to the mouth, microphone 1 has a higher amplitude of speech than microphone 
2, even though both microphones also are continuing to detect the ambient noise at similar levels. A simple 
subtraction of microphone 1 from microphone 2 represents the concept of an active noise canceling 
microphone where the difference results in more speech than noise (since the noise content is approximately 
the same on both microphones). When using two omtii directional microphones, a simple subtraction may 
not be sufficient for exact cancellation of the noise signal. This may be due to the microphones having 
slighdy different sensitivities, an obstruction, or a variation in preamplifier hardware characteristics. It is 
therefore necessary to incorporate a variable gain in order to compensate for these variations. 

In general, the variations in omni- directional microphones will not be frequency dependent, but 
rather gain related. Therefore, the adaptive filter (3) will be implemented using a single weight, w, to control 
the gain variations between microphone 1 and 2. The resulting signal is: 

si = c — w*r 

where, 

Wk+i = Wk + mu*r* si 

and the subscript on the adaptive weight refers to the iteration number. After a sufficient number of 
iterations transpire, the signal si will be minimized by the gain w. The resiilting s%nal, si, is equivalent to that 
of an optimized active noise canceling microphone. However, die difference is tiiat the tuning of the relative 
gain between microphone 1 and 2 is performed automatically by the adaptive filter. 

Continuing on with Figure 1 and the embodiment description, si is used as the error s%nal to the 
next adaptive stage enclosed by the dotted line in the right side of Figure 1. The microphone 2 signal is used 
as the reference signal in the second adaptive filter (5). This adaptive filter is designed to have as many 
weights as is practical for the particular DSP implementation and desired bandwidth (typically up to 4 kHz 
for speech). This adaptive filter performs an optimal minimization of the signal sz by subtracting (6) any of 



the noise in signal si remaining from the first adaptive process. Before the specific advantages are noted, 
Figure 2 illustrates one further detail that is disclosed as part of this invention. 

Each adaptive filter operates on the premise of minimi2ing its respective error signal. During 
moments when the speaker is active (speaking), the optimal solution to minimizing the error must change to 
compensate for the new direction of the "noise'' source. In fact, we do not want to cancel the voice, only the 
noise. Therefore, it is required that we prevent adaptation of the adaptive filter during time segments when 
voice is present. In order to instantaneously identify those time segments in real time, we need only to look 
at the output power of the error signal (output of 4, 6 or 7). Figure 2 illustrates the method that is disclosed 
for controlling adaptation as a function of the voice. The output of the adaptive filter (8) is subtracted from 
(7) the input signal to create the error signal that is used to update the adaptive filter. During periods of 
quiet, this error signal is mitiimized below a certain level threshold (11). The error s^al is continuously 
compared (10) to the fixed threshold value (11) and if it is below the threshold then adaptation continues as 
determined by a switch (9) that controls the convergence parameter mu in the adaptive weight update to be 
some nonzero constant "a". If the error s%nal instantiy rises above the threshold, then the comparator 
signals the switch (9) to set the convergence parameter to zero, ceasing adaptation on the speech. 
(Optimizing this operation as a function of frequency is discussed as the second part of this invention in 
subsequent paragraphs). 

The process of Figure 2, where the convergence parameter controls update of the adaptive filter as a 
function of the presence of voice, is required in order to prevent cancellation of the voice signal by the 
adaptive filter. It should be noted that prior art does not disclose a method for controlling the convergence 
parameter strictiy as an instantaneous function of the error signal used by the adaptive filter. Figure 1 illustrates 
the first exemplary embodiment of this invention. Incorporating the convergence control of Figure 2 into 
each of tiie adaptive filters (3 and 5) of Figure 1, a distinct advantage over the prior art is seen. The method 
of controlling the convergence rate instantaneously increases the response time of the adaptive filter to 
speech transients, as well as reduces computational load that is seen when incorporating an average or mean 
calculation over a period of time. 

If the prior art adaptive noise canceHng microphone is tested in noise environments having high 
reverberation times, it will be seen that the overall noise reduction performance can be less than that of a 
simple passive noise canceling microphone. This is due to the fact that the coherence between two 
microphones in a highly reverberant environment can be less than that in an anechoic envitronment. The 
performance of an adaptive filter in a feedforward control arrangement is a direct function of the coherence 
between the reference and the disturbance measurement. The new dual adaptive filter arrangement shown in 
Figure 1 solves this problem. By using a single weight adaptive filter and subtracting the reference (r) from 



the communication microphone (c), the exact performance of the passive noise canceling microphone is 
achieved as the signal Si. At this point, regardless of the coherence between the signal si and r (determining 
the performance of the second adaptive stage), we can be assured that the minimum petfotmance 
achieved will be at least equal to that of the passive noise canceling microphone. (Note that the single 
5 weight adaptive filter is less dependent on the broadband coherence between r and c, and primarily focuses 
on the very low frequency coherence which is typically high even in reverberant environments). In certain 
cases, the second adaptive filter (5) may offer no performance and S2 will be equal to si, which is precisely the 
performance of the passive (or tuned active) noise canceling microphone. 

This invention provides a new level of robustness in the adaptive noise canceling microphone design 
10 that is not anticipated by any of the prior art. This invention ensures that the worst (adaptive) 
performance that can be expected is no less than that of a passive noise canceling microphone. It 

should be emphasized that the first adaptive filter is only a single weight and acts as a caKbration gain to 
optimally match the levels between c and r to mhmmzt the mean squared error. Larger adaptive filters (3) in 
i,Q the calibration location wiU suffer the same difficulty in suppressing noise as (5) if the coherence is too low 
5 between the inputs. 

\ J As noted earUer, the successful adaptation of (3) relies on the coherence between the signals at (1) 

■ w. 

'lU and (2). There may be instances when it is advantageous to only adapt the first adaptive filter (3) of Figure 1 

. for a short time before fixing the calibration gain (3) because of poor coherence between the two inputs. This 

can be easily accompEshed by using a timer that sets the convergence parameter mu to zero after a specified 
,'^20 period of time. This ensures that the communication signal used in the second adaptive stage of Figure 1 
^.y (enclosed by dotted line) is always equal in performance to that of a fixed active noise canceling microphone, 
while stiU having the benefit of having a correlated reference signal. It should be noted that this 
configuration is not possible via Bartiett's invention (5473684) because he uses two first-order differential 
microphones. It should also be noted that omni- directional microphones are much more convenient to 
25 implement (physically) since theit: backs can remain closed. This improves their local proximity to one 
another because they can be situated directiy adjacent to each other. In highly reverberant fields this is a 
distinct advantage for high frequency control since there is a higher coherence between the signals, the closer 
the microphones get. Therefore, for the optimal positioning of Figure 6, the configuration of Figure 1 is the 
only way to provide a minimum of passive noise canceling performance using two omni-directional 
30 microphones in an adaptive controller. 

A further improvement in noise canceling microphone performance derives from the use of 
frequency domain adaptive filtering (FDAF). FDAF is a method for designing adaptive filters and adaptive 
controllers that performs the weight update in the frequency domain. The adaptive noise canceling 



8 



microphone is a particularly suitable appUcation for FDAF because of the inherent dependence on frequency 
domain characteristics of both the speech and noise. In general, the ambient noise to be canceled by a noise 
canceling microphone will usually be broadband or random in nature. Speech elements can be very 
narrowband, or at times broadband. As mentioned earlier, it is desirable to cease adaptation of the adaptive 
5 filter during times when there is speech so it is not canceled. Figure 2 illustrated one possible way to perform 
this switching adaptation as a function of output power for a single convergence parameter. 

All prior art implementations of such a convergence parameter have focused on time domain 
control When using the LMS algorithm in the time domain, only a single convergence parameter can be 
used. If a vector of convergence parameters were proposed for the time domain LMS algorithm, there 
10 would be no logical way to control tiieir state. Furtiier, since prior art has only proposed time domain signal 
power control, all of these metiiods cease adaptation of the ENTIRE adaptive filter each time the signal 
power exceeds a certain threshold. It should be clear that since speech can be narrowband in its spectral 
content, it is not necessary to stop adaptation of die ENTIRE adaptive filter, but only the parts tiiat are 
^=0 affected by the speech signal itself Therefore, it is clear tiiat this frequency domain implementation of the 
^- JL5 convergence parameter offers improved performance opportunities. 

Id Frequency dependent convergence as described here is impossible to accomplish in the time domain. 

Therefore the invention disclosed next is to provide a frequency domain adaptive filter used in a unique 
adaptive noise canceling microphone arrangement so that individual segments of the noise bandwidth can 
continue to adapt while die segments of the speech bandwidtii are fixed during speech. This is accomplished 
1:320 using the microphone and algoritiim construction shown in Figures 3 and 4. 

^ Figure 3 is a general block diagram showing two microphone signals (12 and 13) entering tiie 

frequency domain adaptive controller 14 that generates the output 16. The output is the cleaned speech 
signal or error signal to be minimized. As witii the time domain structure, the convergence of die adaptive 
filter is controlled by selectively turning off tiie convergence parameter mu as a fiinction of the output power 
25 of the adaptive controller. This is accompUshed generally dirough die frequency domain convergence 
control (15). As mentioned earlier, die primary difference (and key advantage) here is tiiat die convergence 
can be controlled as a function of frequency. 

Figure 4 illustrates a more detailed implementation of Figure 3 in an unconstrained frequency 
domain adaptive filtering format. The communication microphone signal (17) has the control signal (output 
30 of 23) subtracted from it in tiie time domain to produce die output (or error) signal 39. To perform the 
adaptive filtering operation in die frequency domain, care must be taken to prevent circular convolution. It 
should be noted that Figure 4 illustrates circular correlation in tiie computation of die weight update and is 
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therefore known as unconstrained adaptation. The inventive feature is the control of the convergence 
parameters. To prevent circular convolution during the filtering operation in the frequency dotnain, two 
block sizes are concatenated with each other (19) before the fast Fourier transform (20) is taken of the 
reference input This reference is then multipUed (21) by the adaptive filter weights in the frequency domain 
to create a filter output that is inverse fast Fourier transformed (22) and appropriate samples are taken as the 
block output (23). The output or error signal (39) is concatenated with appropriate zero padding before the 
FFT (30) is taken and the correlation is computed (29) for the weight update. 

A critical part of this invention enters at the multiplication (28) of die convergence parameters by the 
correlation of the tap input vector and the error signal. The convergence parameters are formed as a 
fiinction of frequency and stored in a vector alpha_bar (32). This is accomplished by first taking the FFT 
(37) of the instantaneous error signal (39). The power in EACH of the spectral bins of this FFT is then 
compared (36) to either one of two stored vectors. The first possibility is a manually entered predetermined 
set of magnitude threshold values (as a fianction of frequency) diat represent the controlled spectral bins of 
the noise level of signal 39 when no speech is present. The second possibiUty is diat the controlled spectrum 
is stored during a time when no speech is present, which represents a typical controlled output spectrum. 
Either vector (which is a threshold magnitude as a function of frequency) should contain nearly the same 
values. On a frequency bin-by-bin basis, the magnitude of the output of (37) is compared (36) with the 
stored magnitude of (35) the threshold values and a decision is made to choose either 34 or 33. This 
comparison operation is typicaUy accompUshed through a "if statement in a software code, but can also be 
implemented using FFT and comparator hardware components. If the magnitude of the actual signal 
(output of 37) in a bin is greater than the stored tiireshold (35) in tiiat same bin, then there is speech in that 
bin and the convergence parameter for that bin (vector location) is chosen to be zero (33). Likewise, if the 
actual bin measurement is lower than the stored tiireshold, a nonzero adaptation constant "a" (34) is chosen 
for that respective element of the vector alpha_bar. After each frequency is examined, tiie vector alpha_bar 
will consist of a series of zeros and nonzero constants "a", where tiie zeros reside in all spectral bins whose 
magnitude was greater titian the stored tiireshold values. This vector is tiien multipHed by tiie identity matrix 
(31) and the result is multiplied (28) by tiie correlation. Finally, the current and fixture (25, 26) frequency 
domain weights are computed and multipUed by tiie input tap vector (21). These steps are repeated each 
time a new input and error block is accumulated. 

It should be clear from tiie above discussion tiiat the convergence parameters can vary witiiin one 
iteration as a fianction of frequency. This is a critical advantage over the prior art, because adaptation of the 
filter can continue in bins tiiat do not have speech in tiiem. In particular, it is unusual to have speech 
formants at frequencies below 200 Hz for most speaking voices. Therefore, it is possible, using tiie invention 
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